Generating derivative structures: Algorithm and applications 
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We present an algorithm for generating all derivative superstructures — for arbitrary parent struc- 
tures and for any number of atom types. This algorithm enumerates superlattices and atomic 
configurations in a geometry-independent way. The key concept is to use the quotient group asso- 
ciated with each superlattice to determine all unique atomic configurations. The run time of the 
algorithm scales linearly with the number of unique structures found. We show several applications 
demonstrating how the algorithm can be used in materials design problems. We predict an alto- 
gether new crystal structure in Cd-Pt and Pd-Pt, and several new ground states in Pd-rich and 
Pt-rich binary systems. 



I. WHY DERIVATIVE STRUCTURES? 

Derivative superstructurea^' play an important role in 
different materials phenomenon such as chemical order- 
ing in alloys, spin ordering in magnets, and vacancy or- 
dering in non-stoichiometric materials. Similarly, deriva- 
tive super lattice J23BI are important in problems such as 
twinning. What is a derivative superstructure? A deriva- 
tive superstructure is a structure whose lattice vectors are 
multiples of those of a 'parent lattice' and whose atomic 
basis vectors correspond to lattice points of the parent 
lattice. Many structures of intermetallic compounds can 
be classified as fcc-derived superstructures. These super- 
structures have atomic sites that closely correspond to 
the sites of an fee lattice but some of the translational 
symmetry is broken by a periodic arrangement of dif- 
ferent kinds of atoms. The structures shown in Fig. [2] 
comprise the set of all fcc-derived binary superstructures 
with unit cell sizes 2, 3, and 4 times larger than the par- 
ent lattice. 

Large sets of derivative superstructures are often used 
in (practically) exhaustive searches of binary configura- 
tions on a lattice to determine ground state properties 
of intermetallic systems. The approach is not limited to 
searches of configurational energies, but other physical 
observables can also be targeted if an appropriate Hamil- 
tonian is available. For example, Kim et al\^ used an 
empirical pseudopotential Hamiltonian and a large list of 
derivative superstructures to directly search semiconduc- 
tor alloys for desirable band-gaps and effective masses. 
The set of derivative superstructures is useful in any sit- 
uation where the physical observable of interest depends 
on the atomic configuration. 

For the aforementioned reasons, an algorithm for sys- 
tematically generating all superstructures of a given par- 
ent structure is useful. Such an algorithm has been pre- 
sented in the literature only once^ (FWZ below), but 
closely related algorithms ha ve be en implemented in sev- 
eral alloy modeling packages .I^EEl The FWZ algorithm is 
restricted to fee- and bcc-based superstructures and to 
binary cases only. Furthermore, the list generated by the 
FWZ algorithm is formally incomplete (though in prac- 



FIG. 1: An example parent lattice (left) and a superstructure 
(right). The parent lattice is fee and the superlattice is defined 
by the (doubled) unit cell outlined in gray. The two interior 
points of the superlattice are occupied by one black atom 
and one gray atom. The superlattice -I- atoms constitute a 
derivative structure. The superstructure of this example is 
that of CuAu. 

tice it may be sufficient).'^ 

The purpose of this paper is to present a general al- 
gorithm that generates a formally complete list of two- 
or three-dimensional superstructures, and that works for 
any parent lattice and for arbitrary fc-nary systems (bi- 
nary, ternary, etc.). This algorithm is conceptually dis- 
tinct from FWZ and related implementations. Instead 
of using a geometrical, 'smallest first' approach to the 
enumeration,'Sl it takes advantage of known group theo- 
retical properties of integer matrices. The algorithm is 
orders of magnitude faster than FWZ, more general, and 
formally complete. A Fortran 95 implementation of the 
algorithm is included with this paper as supplementary 
material. 

Mathematically, we can describe the purpose of the 
algorithm as this: for a given parent lattice, enumer- 
ate all possible superlattices and all rotationally- and 
translationally-unique 'colorings' or labelings of each su- 
perlattice. In presenting the algorithm in the next sec- 
tion, we shall refer to superlattices and labelings rather 
than referring to crystal structures or atomic sites. 



II. 



ENUMERATING ALL DERIVATIVE 
STRUCTURES 



Here is a brief outline of the algorithm. 
1. For each superlattice of size n, generate all Hermite 
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FIG. 2: The first 17 binary structures de- 
rived from tlie fee lattice. All have 4 
atoms/cell or less. Structures shown with a 
green plane can be characterized as a stack- 
ing of pure A and B atomic layers. For ex- 
ample the Llo structure (upper left) is an al- 
ternating (AiBi) sequence of layers stacked 
in the [001] direction. All of the 2- and 3- 
atoms/cell structures have physical manifes- 
tations. Of the 4-atoms/cell structures only 
four have physical manifestations. Three of 
the others (yellow backgrounds) have been 
predicted to exislP^ but not yet observed. 
The other five (purple backgrounds) have 
never been observed or predicted to exist in 
any system. 





N=4 



L1. 



v/v:PJO 



iaa:| |ni| |i:l| i^jjii |l:j 



■Kbh' 




Hn 



(Ayzr; 



;ptaTi) 




Normal Form (HNF) matrices.'^ (In what follows, 
we refer to n as the index of the superlattice.) 

2. Use the symmetry of the parent lattice to remove 
rotationally-equivalent superlattices, thus shrink- 
ing the list of HNF matrices. 

3. For each superlattice index n, find the Smith Nor- 
mal Form (SNF) of each HNF in the list, and: 

(a) Generate a list of possible labelings (atomic 
configurations) for each SNF, essentially a list 
of all fc" numbers in a base k, n-digit system. 
For the labels, we use the first k letters of the 
alphabet, a,b, ■ ■ ■ . 

(b) Remove incomplete labelings where each of 
the k labels (a, 5, • • • ) does not appear at least 
once. 

(c) Remove labelings that are equivalent under 
translation of the parent lattice vectors. This 
reduces the list of labelings by a factor of w n. 

(d) Remove labelings that are equivalent under an 
exchange of labels, i.e., a ^ &, so that, e.g., 
the labeling aabbaa is removed from the list 
because it is equivalent to bbaabb. 

(e) Remove labelings that are super-periodic, i.e., 
labelings that correspond to a non-primitive 
superstructure. This can be done without us- 
ing the geometry of the superlattice. 

4. For each HNF, remove labelings that are permuted 
by symmetry operations (of the parent lattice) that 
leave the superlattice fixed. 

An important feature of the algorithm is that the the 
list of possible labelings, generated in step |3a| form a min- 
imal hash table with a perfect hash function. Eliminating 
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FIG. 3: Required CPU time as a function of the number of 
unique derivative superstructures found. The scaling is linear, 
the best possible scaling for this type of problem. 



all duplicate labelings in a list of N can be accomplished 
in 0{N) time. Coupled with the group theoretical ap- 
proach, this results in an extremely efficient algorithm, 
orders of magnitude faster than FWZ, and which scales 
linearly. That is, the time to find N unique structures 
scales linearly with N, regardless of the size of N. This 
linear scaling is shown for the case of binary superstruc- 
tures of an fee parent lattice in Fig. |3] 



A. Generating all superlattices 

Given a 'parent' cell (any lattice), the first step in find- 
ing all derivative structures of that cell is to enumerate 
all derivative superZattices. Consider the transformation 
B = AH where A is the basis for the original lattice (the 



basis vectors listed column- wise), H is a matrix with all 
integer elements, and B is the matrix of the transformed 
lattice. If the determinant of the transformation matrix, 
|IHI|, is ±1, then IH is merely a change of basis that leaves 
the lattice represented by A unchanged. Matrices A and 
B are merely two different choices of basis for the same 
lattice. On the other hand, if the elements of EI are all in- 
tegers but the determinant of HI is 2, say, then the lattice 
of B is a superlattice^l of (i.e., a subgroup) of the lattice 
defined by A, but with twice the volume of the original 
(parent) lattice. 

Two different matrices. Hi and IHI2, with the same de- 
terminant, will generate different bases for the same lat- 
tice if and only if Hi can be reduced to H2 by elemen- 
tary integer column operations. The canonical form for 
such operations is lower triangular Hermite Normal Form 
(HNF). Thus, if we use only matrices H which are in 
HNF, we will produce exactly one representation of each 
superlattice.'^ * In three dimensions, the lower-triangular 
Hermite Normal Form is 




with 



0<b<c, 0<d,e< f. 



(1) 

In this form, the product of the integers on the diagonal 
alone, ax ex /, fixes the determinant. Again, we refer 
to the superlattice size, or the determinant, as the index 
n. Generating all HNF matrices of a given index can be 
done then by finding each unique triplet, acf — |H|, and 
then generating all values of b, d, and e that obey the 
conditions in ^. 

The algorithm for generating all possible HNF matrices 
of a given index |H| is rather simple, comprising just two 
steps. In the first step, find all possible diagonals: find 
all values a, 1 < a < |H|, that evenly divide |H|; for each 
of these values, find all c, 1 < c < |H|/a, that evenly 
divide |H|/a. For each value of c, let / — |H|/(ac). For 
example, consider the case of |H| = 6. We execute two 
nested loops over the possible values of a and c; each loop 
runs over all integers between 1 and the |H|, testing the 
above conditions at each iteration. The loops run from 
1 to 6, and the algorithm finds eight cases that meet the 
above conditions. They are: 



a 


1 1 


1 


2 2 3 3 6 


c 


1 2 


3 


13 12 1 


f 


6 3 


2 


3 12 11 



The set of acf triplets generated during this first step 
comprises all possible diagonals of the HNF matrices for 
the case of n == |H| = 6. The second step, generating 
each set of values of b, d, and e for each diagonal (set 
of acf triplets), can be accomplished simply by three 
nested loops that start at zero and terminate at 6 < c 
and d,e < f. 

As an example of both steps, consider the case where 
the index is merely double that of the original lattice, i.e., 
where |H| = 2. The factors of 2 are just the set {1, 2}, so 



the first step finds only three cases: (2,1,1), (1,2,1), and 
(1,1,2). Then, generating the off-diagonal terms for each 
of these three cases, we find seven HNF matrices: 



/ 1 



case 2: 




2 

case 1: I 1 

1 



1 

case 3: I 1 

2 



For increasing index, n = |H| = 1, 2, 3, • • • , the number 
of HNF matrices generates an interesting sequence: 1,7, 
13, 35, 31, 91, • • • . This sequence appears in Sloane's 
databasd^ as AOOIOOI. The closed- form expression for 
n-th term in the series is 




^da(d) = n 



(pr^-i)(pr^-i) 

(p,-l)2(p, + l) 



(2) 



where a is the sum of divisors function, and where the pi 
and ei are the prime factors and powers oi n: n = p\^ ■ 

jhy 



P'2 ' ' ' Pil ^- Th e sequence appears in the cryst a,llograpI: 
literaturel^IHl as well as several other contexts. ^^MlZIi 

Significantly, because we have an expression for the 
number of superlattices, the implementation of the HNF- 
generating algorithm can be rigorously checked. Also 
note that this step of the algorithm is independent of the 
choice of parent lattice. 



B. Reducing HNF list by parent lattice symmetry 

The set of HNF matrices defines the set of all deriva- 
tive superlattices of a parent cell via the transformation 
mentioned above, B = AH. However, not all of the super- 
lattices in this set will be geometrically different. Some 
distinct lattices will be equivalent under symmetries of 
the parent lattice, illustrated in the example below. 

Such duplicate superstructures must be eliminated by 
the algorithm. At the end of the algorithm we want all 
derivative structures to be unique from a materials point 
of view. So we wish to exclude from the list any super- 
structures that are related to others already in the list 
simply by a rotation, reflection or change of basis. 

As an illustration, consider a two-dimensional par- 
ent lattice that is square, that is, A = I (the two- by- 
two identity matrix). There are three HNF matrices 
for which |H| = 2 and three corresponding superlattices, 

B = AH = IH = H: 






L 



V. 



The parent lattice itself is indicated by dots (filled and 
unfilled) while the superlattice is indicated by filled dots. 
The vectors defined by the matrices are shown as ar- 
rows. The first two lattices are clearly equivalent under 
a 90° rotation, one of the eight symmetry operations of 
a square lattice. 

To enumerate the distinct super lattices of a given index 
n then, we must check that each new superlattice that is 
added to the hst is not a rotated duplicate of a previous 
superlattice. More precisely, we must check that each 
new basis B^ is not equivalent, under change-of-basis, to 
some symmetric image MBj of a basis Bj already in the 
list. In other words, we want to avoid the relation B^ = 
MBjH where B^ is a candidate superlattice, M is any of 
the rotations of the parent lattice, Bj is a superlattice 
already in the list of distinct superlattices, and EI is any 
unimodular matrix of integers. (Since B^ and Bj have 
the same determinant, we will only need to check that 
B^"'"M^-'-Bi is a matrix of integers.) 

For the case of cubic symmetry, the seven superlattices 
for the EI = 2 case mentioned above reduce to only two 
symmetrically-distinct superlattices. The corresponding 
derivative superstructures are LIq and Lli, both well- 
known structures in intermetallic compounds. The fact 
that these are the only two 2-atom/cell fee structures is 
not coincidence or an accident of chemistry; no other 
2-atom/cell structures are possible geometrically. The 
hierarchy of physically-observed structures uncovered for 
fee and bcc lattices as the index is increased is discussed 
in Refs. 19f20i Additional applications can be found in 
SecIVl 



C. Find the unique labelings for all superlattices 

1. Generate all possible labelings 

For each HNF, each superlattice, we start by generat- 
ing all possible labelings of that superlattice. In other 
words, given k colors (types of atoms), represented by 
the labels a,b,- ■ ■ , we generate all possible ways of la- 
beling (coloring) the superlattice. Each HNF matrix of 
determinant size n represents a superlattice with n inte- 
rior points to be decorated. If the number of colors is fc, 
then the list of all possible labelings is easily represented 
by the list of all n-digit, base-fc numbers. So, from a com- 
binatorial point of view, there are k" distinct labelings. 
For example, in the case of a binary system (fc — 2) with 
4 interior points (index n = 4), there are 2^ = 16 possible 
labelings (see Table IT|. 



aaaa 
aaab 
aaba 
aabb 


abaa 
abab 
abba 
abbb 




baaa 
baab 
baba 
babb 



bbaa 
bbab 
bbba 
bbbb 



TABLE I: An example labeling for the binary case, k — 2, 
with 4 interior points, n = 4, in the superlattice. There are 
k" = 2"^ — 16 distinct labelings, but the colored labelings 
represent incomplete or duplicate superstructures. Yellow 
labelings are incomplete (not all labels are present), purple 
are translation duplicates, blue are label-exchange duplicates, 
and green are super-periodic labelings. 
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FIG. 4: (I) One-dimensional example of a parent lattice, 
(II) a derivative superlattice (index n = 4), (III) one possible 
superlattice labeling. 



2. Concept of eliminating duplicate labelings 

The rest of the algorithm deals with just one concep- 
tual issue — given the fc" labelings/colorings of the su- 
perlattice, eliminate the duplicates. In the FWZ algo- 
rithm and its extended implementations, duplicate struc- 
tures are eliminated by directly comparing^ ^ one can- 
didate structure to another geometrically, necessitating 
an expensive 0{N^) search. We eliminate the dupli- 
cates via group theory rather than checking the struc- 
tures themselves. Although this approach is more ab- 
stract than the geometric approach, it is much more 
efficient — eliminating the duplicates in a list becomes a 
strictly 0{N) procedure. 

One- dimensional example: We start with a simple il- 
lustration and then discuss the essential group theoreti- 
cal concepts in the context of that example. Consider the 
one-dimensional case of Fig. El The first line (I) is a par- 
ent lattice, an infinite collection of equally-spaced points, 
identified with the set of integers, denoted Z. The second 
line (II) is a superlattice, a subset of the parent lattice 
(every fourth point; those colored black). The third line 
(III) is a superstructure, a 'labeling' or 'coloring' of the 
parent lattice that has the same periodicity as the super- 
lattice. The points of the lattice play the role of positions 
in a crystal, and the colors/labels play the role of atoms 
placed at those positions. 

There are labelings that are distinct yet physically 
equivalent, as shown in Fig. [5] Note that line (II) is ob- 
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FIG. 5: Three labelings of the superlattice of Fig. HI Lines 
II and III are identical to line I except the colors (labels) are 
shifted. In line II the colors have been shifted two units to 
the right. In line III, the shift has been effected by translating 
the lattice two units to the left. 



taincd from line (I) by shifting the colors two units to the 
right, and line (III) is obtained from line (I) by shifting 
the numbers two units to the left — with the same result. 
Lines (II) and (III) are the same labeling, obtained in 
different ways from (I) . Both are physically equivalent to 
(I). The fact that we can obtain such a shifted labeling 
either by shifting the numbers or by shifting the labels 
explains why we can do much of our equivalence checking 
within a finite group, instead of geometrically within an 
infinite lattice. By this method, we will identify these 
equivalent labelings and remove them from the original 
list of fc" labelings. 

We may illustrate the group theory approach using 
Fig. |4] The parent lattice (I) is the set of integers Z, 
which is a group under the addition operation. We refer 
to this group as L. The superlattice is the set of mul- 
tiples of 4, denoted 4Z. We refer to this subgroup of L 
as S. We label the parent lattice L in a manner which 
is periodic with respect to the superlattice S, and note 
that if two points differ by an element of the superlattice, 
they must receive the same label. We use colors as labels 
in line (III) of Fig. H and note that every fourth point 
has the same color. 

Notice that the green points are our superlattice are 
4Z, and the yellow points are a copy of 4Z, but translated 
one unit to the right. Thus, we may denote the latter 
set (the yellow points) by the set 1 + 4Z. Similarly, the 
red points are the set 2 + 4Z, and the blue points are 
3 + 4Z. These four sets, 4Z, 1 + 4Z, 2 + 4Z, 3 + 4Z, are 
mutually disjoint (they don't overlap), and their union is 
the entire parent lattice L. They are translations of S; 
and thus are the cosets of the subgroup S. This means 
we can use them to form a new group, called the quotient 
group G = L/S (see Table [ll| . This new group is finite, 
having only four elements. For notational convenience we 
shall also refer to these 4 elements of G as (0,1,2,3). We 
need only label the four elements of our quotient group 
in order to label the entire parent lattice. 

Suppose we wish to translate a labeling (in order to 
identify and eliminate equivalent structures). As shown 
in line (I) of Fig. [5] we have labeled the elements of the 
quotient group as follows (using g, y, r and b for the 





Z 1 + Z2 + Z.3 + Z 


z 


Z 1+Z2+Z3+Z 


1 + Z 


1 + Z 2 + Z 3 + Z Z 


2 + Z 


2+Z3+Z Z 1+Z 


3 + Z 


3+Z Z 1+Z2+Z 





12 3 





12 3 


1 


12 3 


2 


2 3 1 


3 


3 12 



TABLE II: Two representations of the Cayley table for the 
quotient group G (cosets of the subgroup S). On the right, 
the elements of the group have been denoted (0,1,2,3) for 
notational convenience. 



colors): 






4Z - 


^ 9 


I 


I + 4Z - 


-^ y 


2 


2 + 4Z - 


~> r 


3 


3 + 4Z - 


^ b. 



In Fig. [5] we see that translating the labels by two is the 
same as simply adding 2 to each coordinate, thus: 



4Z - 


-^ 2 + 4Z 




- 


^ 2 


1 + 4Z - 


-^ 3 + 4Z 




I - 


^ 3 


2 + 4Z - 


-> 4 + 4Z = 


= 4Z 


2 - 


^ 


3 + 4Z - 


^ 5 + 4Z = 


= 1+4Z 


3 - 


-> 1 



The effect is the same as if we had assigned the colors 
differently: 






4Z - 


^ r 


1 


1 + 4Z - 


-> b 


2 


2 + 4Z - 


■^ 9 


3 


3 + 4Z - 


■^ V- 



Translating the lattice by adding +2 to every point (mov- 
ing the origin by 2 units) has the same effect on the la- 
beling as if we had merely labeled the four elements of 
the quotient group, and then added +2 to every element 
of the group, producing the permutation — > 2, I — > 3, 
2 ^ and 3 ^ 1, denoted (2,0,3,1). 

Instead of determining that two labelings of the (infi- 
nite) lattice are equivalent by translation, we may simply 
check that the corresponding labelings of our finite quo- 
tient group G = Z4 are equivalent. We do this by just 
adding a fixed element to every element in the group, ef- 
fecting a permutation of the cyclic group Z4. This idea — 
of labeling the quotient group instead of the lattice ele- 
ments, and checking equivalence within the group instead 
of by translating the lattice — may seem unduly abstract 
and unnecessary in one dimension, but it becomes much 
more efficient and crucial in higher dimensions, as we now 
show. 

Application to higher dimensions: In any dimension, 
we have a parent lattice L, and a superlattice S which is a 
subgroup of L. Labeling L in a manner which is periodic 
with respect to S is equivalent to merely labeling the 
elements of the quotient group G = L/S. Note that even 
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FIG. 6: A superlattice whose SNF is non-trivial (non-cyclic). 
Because the quotient group of the superlattice is non-cyclic, 
translations of the lattice are not cyclic permutations. Instead 
the sites are permuted in groups of two or four with each 
translation (rather than permuting as a single group of 8). 
In the figure, three different pictures of the same superlattice 
are shown. In each case a different origin is chosen. In the 
second case, the translation permutes two groups of 4 sites, 
in the third, 4 pairs are permuted. 



though L and S are infinite sets, their quotient group is 
always a finite group with the same number of elements as 
the superlattice index n. Again, we check for equivalence 
by doing operations within the group instead of by lattice 
translation. 

The key to this approach is the Smith normal form 
(SNF). The SNF is useful because it provides the quo- 
tient group directly as follows. Recall that if A is a basis 
for L, then the distinct lattices of index n are uniquely 
characterized by bases B — AH, where H is a matrix of 
determinant n in HNF. If 5" is given by one such basis 
Bi = AHi, then the quotient group G = L/S can be 
found by converting the matrix Hi into SNF (which is a 
diagonal matrix with certain special properties; see Ap- 
pendix). In higher dimensions the quotient group may 
not be purely cyclic, but it is a direct sum of cyclic 
groups which are given by the diagonal entries in the 
SNF matrix (see Fig.[6|. For example if the SNF matrix 

2 \ 

, then the quotient group G = L/S is the direct 




In relation to the algorithm, there are two important 
facts to note about the SNF. (i) The SNF provides the 
quotient group directly, which in turn is the key to im- 
plementing an 0{N) algorithm, (ii) The number of SNFs 
(and so quotient groups) is small compared to the number 
of distinct lattices of index n (see Table III I . This means 
that translation duplicates can be removed from the fc" 
list for hundreds or thousands of different superlattices 
simultaneously. (The surprising geometric implications 
of this are discussed in the appendix.) This reduces the 
running time by many orders of magnitude. 



n 


2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 


HNFs 


7 13 35 31 91 57 155 130 217 133 455 183 399 403 651 


SNFs 


112111321121114 



TABLE III: Table showing the number of Hermite normal 
form (HNF) matrices and Smith normal form (SNF) matrices 
as a function of index n (determinant size). The number of 
HNFs is a rapidly-increasing function of n (see Eq. [2| whereas 
the number of SNFs grows very slowly. 



3. Eliminating translation duplicates 

Because of its periodicity, the choice of origin of a su- 
perlattice is arbitrary. A change in origin implies a per- 
mutation of the labels that nonetheless defines the same 
superstructure (compare lines I and III in Fig. [sl. As 
stated previously, by examining the quotient group in- 
stead of directly comparing the structures, the duplicate 
labelings can be readily identified. For example, consider 
the case for n = 4. Adding each member to the quotient 
group Z4 — (0, 1, 2, 3) effects 4 permutations as follows: 



mem 


ber mapping 


permutation 



1 
2 
3 


0^ 0, 1^ 1, 2^ 2, 3- 
0-^ 1 -> 2 -> 3 -> 
0^ 2 ^ 0, 1^ 3 ^ 1 
0^ 3 ^ 2 ^ 1 ^ 


. 3 (0,1,2,3) 
(1,2,3,0) 
(2,0,3,1) 
(3,0,1,2) 



If we take the 14 complete labelings of Table |T] and the 
three non-trivial permutations above, we find that 10 
are duplicates (colored purple in Table III: 



(0,1,2,3) 


(1,2,3,0) (2,0,3,1) (3,0,1,2) 






duplicates 


aaab 


aaba 


abaa baaa 


aabb 


abba 


bbaa baab 


abab 


baba 


abab baba 


abbb 


bbba 


bbab babb 



Of the original 2* = 16 labelings, two were discarded 
immediately because they were incomplete. Of the re- 
maining 14, 10 are translation duplicates, leaving 4 that 
are translationally inequivalent (left column above). 



4. Remove 'label- exchange' duplicates 

The next step in the algorithm is to remove labelings 
that are equivalent under exchange of labels. Struc- 
turally, there is no difference between a superlattice 
whose interior points are labeled aaab versus bbba. Al- 
though the energy of an isostructural compound with 
composition X3Y1 is different from one with composi- 
tion X1Y3, we only wish to include one entry in our list 
of derivative superstructures because the full composi- 
tion list can always be recovered by making all possible 
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FIG. 7: (color on-line) (I) One-dimensional example of a 
parent lattice, (II) a derivative superlattice (index n = 4), 
(III) a super-periodic (or non-primitive) labeling. Although 
the index of the superlattice is n = 4, the structure can be 
represented by a superlattice labeling of period 2 instead of 
4. The superstructure of line III would have been found as an 
index n = 2 derivative structure and is therefore a duplicate. 



FIG. 8; (color on-line) Two identical 4x4 superlattices 
(dotted lines) with rotationally-equivalent labelings (red and 
blue circles). Although the superlattices themselves are un- 
changed, a 90° rotation applied to the left labeling yields that 
shown on the right. Thus the second is a duplicate of the first 
and should be removed from the list of labelings. 



6. For each HNF: remove 'label-rotation' duplicates 



label exchanges (i.e., a ^ b). In the example above, four 
labelings were unique under translations: 



aaab 
aabb 
abab 
abbb 



But the first and the fourth are equivalent by exchanging 
a ^ b and applying the permutation (1,2,3,0). 



Remove super-periodic labelings (non-pnmitive 
structures) 



At this point of the algorithm, many of the duplicate 
labelings have been removed from the original k"' list. 
But there are still more duplicates to remove. Some of the 
labelings in the list will represent superstructures that 
are not primitive. In other words, the labelings will be 
super-periodic — they will have periods shorter scale than 
the superlattice.^^ 

The super-periodic duplicates are easily identified be- 
cause they axe identical under at least one permutation. 
The quotient group G dictates a set of permutations un- 
der which the labelings are duplicate. One of these per- 
mutations will leave the labeling unchanged if it is super- 
periodic. For example, continuing the example above, 
three unique labelings are still in the list: aaab, aabb, 
and abab. One of the permutations of the quotient group 
G = Z4 is (2,0,3,1). Under this permutation, the labehng 
abab is unchanged. Thus it is super-periodic, as depicted 
in Fig. [7) It is a duplicate in the sense that the algorithm 
would have already enumerated this structure with the 
index n = 2 structures. 



The previous three steps of the algorithm yield a list of 
distinct labelings for each SNF of index n. Three kinds of 
the duplicate labelings, translation duplicates and label- 
exchange duplicates and super-periodic duplicates, have 
already been removed. One kind of duplicate remains, 
however, and these are eliminated in the current step. 

This step removes labelings which are permuted by the 
rotations of the parent lattice. Whereas the preceding 
steps were applied to generate a list of unique labelings 
for each SNF, the current step must be applied to each 
HNF. In other words, this step must be applied to the 
surviving labelings separately for each superlattice. 

Superlattices which are not fixed by rotations of the 
parent lattice were already eliminated as duplicates in 
step [2] of the algorithm. But rotations which leave the 
superlattice unchanged may still permute the labeling it- 
self. Such permutations are physically equivalent (merely 
rotated with respect to one another). So any two label- 
ings which are equivalent under rotations that fix the su- 
perlattice are duplicate and one must be removed from 
the list. Figure |8] illustrates the situation in two dimen- 
sions. 

Here again, the group theory approach and the SNF 
makes the search extremely efficient. Label rotation du- 
plicates can be identified easily using the properties of 
the quotient group and the SNF transformation. The 
row and column operations required to transform the 
HNF matrix of a superlattice into its SNF can be rep- 
resented by two integer transform matrices, L and K, 
so that LHM == S, where S is the SNF. This step of the 
algorithm is implemented using the left transformation 
matrix L. 

Let G be a 3 X n matrix where each member of the 
quotient group is represented as a column^'^ in G, and 
let M be one of the rotations that fixes the superlattice. 
Then the permutation of the labels (which is the same as 
the permutations of the quotient group) enacted by the 
rotation M is given by: 



LA"^M(LA"^)" 



(3) 
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Determinant size 



100 



FIG. 9: Symmetrically unique HNF matrices as a func- 
tion of the index n (determinant size) for an fee parent lat- 
tice. Asymptotically, the fraction of unique superlattices ap- 
proaches 2/N = 1/24 ~ 5%, where Af = 48 is the number of 
symmetry operations of the fee parent lattice. 



where columns of A are the lattice vectors of the parent 
lattice, and L is the left transformation matrix for the 
SNF. 

The power of this expression is that it allows the label- 
rotation duplicates to be identified by working entirely 
within the quotient group, without requiring any explicit 
reference to the geometry of the super lattice. Thus, as 
in the other steps, duplicates labelings can be eliminated 
in a time proportional only to the number of labelings in 
the list. 



III. EXAMPLES 

In this section, we give several example derivative 
structure lists enumerated by the algorithm. We dis- 
cuss the symmetry reduction of the structure lists and 
then give results for several cases. First, we compare the 
fcc/bcc binary list to that generated by the FWZ algo- 
rithm. We also show the small-unit-cell binary structures 
for a simple-cubic parent lattice and the small-unit-cell 
ternary structures for an fee parent lattice. 
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FIG. 10: (color on-line) Left axis (red): Number of HNF ma- 
trices as a function of determinant size. Right axis (blue): 
Number of inequivalent fee superlattices as a function of vol- 
ume. 



index # of superlattices 

n fcc/bcc sc hex tetragonal 



2 


2 


3 


3 


5 


3 


3 


3 


5 


5 


4 


7 


9 


11 


17 


5 


5 


5 


7 


9 


6 


10 


13 


19 


29 


7 


7 


7 


11 


13 


8 


20 


24 


34 


51 


9 


14 


14 


23 


28 


10 


18 


23 


33 


53 



TABLE IV: Number of symmetrically unique superlattices 
(HNFs) for several different parent lattices as a function of 
the index size n. Note that fcc/bcc are the same as simple 
cubic (sc) only for the odd values of n, and always smaller for 
the even values. Hexagonal (hex) and simple tetragonal par- 
ent lattices have more unique lattices than the cubic systems 
because of their lower symmetry. 



A. Symmetry reduction of superlattices 



In step B of the algorithm, the complete list of HNF 
matrices is reduced to those that are unique under the 
symmetry operations of the parent lattice. Asymptoti- 
cally, the factor by which the list is reduced is one half 
the order of the rotation group of the parent lattice. For 
example, for cubic parent lattices (simple cubic, face- 
centered-, or body-centered-cubic) , the point group con- 
tains 48 rotations (proper and improper). For superlat- 
tices with large index n, the number of HNFs is reduced 
by a factor of 48/2 — 24. Because every lattice is sym- 
metric under inversion, only the proper rotations (i.e., 
not reflections) need be considered in the reduction (thus 
the factor of 1/2) . Figurelolshows the fraction of symmet- 
rically distinct superlattices for determinant sizes up to 



100, while Fig. [TO] shows the actual number of fcc-based 
superlattices compared to the total number of distinct 
HNF matrices. 

For an fee or bee parent lattice (the numbers are the 
same), the number of unique lattices as a function of 
index n (cell size) is equivalent to the Sloane sequence 
A045790.'^ For the sequences generated for other par- 
ent lattices, which accordingly have a different symmetry 
group, there are no known number-theoretic connections. 
Surprisingly, this is even true for the simple cubic lattice. 
For the simple cubic lattice, the sequence is identical to 
the fcc/bcc one for odd values of the index n but larger 
for the even values. (See Table IV ) 



n 


structures 


cumulative 


n 


structures 


cumulative 


2 


2 


2 


13 


2 624 


8 480 


3 


3 


5 


14 


9 628 


18108 


4 


12 


17 


15 


16 584 


34 692 


5 


14 


31 


16 


49 764 


84 456 


6 


50 


81 


17 


42135 


126 591 


7 


52 


133 


18 


212612 


339 203 


8 


229 


362 


19 


174 104 


513 307 


9 


252 


614 


20 


867 893 


1381200 


10 


685 


1299 


21 


1 120 708 


2 501908 


11 


682 


1981 


22 


2 628180 


5130 088 


12 


3 875 


5 856 


23 


3 042 732 


8172 820 



TABLE V: Number of unique fee derivative structures as a 
function of the index n. The second and fifth columns show 
the number of unique structures for each n, while the third 
and sixth columns show the cumulative total. 

n HNF SNF superlattices labelings 

2 7 1 3 3 

3 13 1 3 3 

4 35 2 9 15 



total 55 4 15 21 

TABLE VL Simple cubic superstructures for n < 4. 



n 


ternary 




quaternary 




structures cumulative 


structures cumulative 


3 


3 


3 


- 


- 


4 


13 


16 


7 


7 


5 


23 


39 


9 


16 


6 


130 


169 


110 


126 


7 


197 


366 


211 


337 


8 


1267 


1633 


2110 


2 447 


9 


2 322 


3 955 


5 471 


7918 


10 


9 332 


13 287 


32 362 


40 280 



TABLE VII: Number of ternary and quaternary derivative 
structures for an fee parent lattice. Compare to the num- 
ber of binary structures of Table IV] As the number of labels 
k is increased, the number of derivative structures increases 
rapidly. 



ternary derivative structures. A figure displaying the 
ternary structures for n < 4 is unnecessary — the ternary 
structures have the same unit cell as the binary struc- 
tures, only the labelings are different. For n = 3 the 
labeling aab is replaced by abc. For n = 4 the labelings 
aaab and aabb are replaced by aabc and abac. And, the 
Agds structure has both labelings, rather than one. 



B. Number of structures of different parent lattices 

The number of superstructures increases much faster 
than the number of super/attices as a function of n. In 
general, each superlattice has many different unique la- 
belings. Table [v] shows the number of fcc/bcc derivative 
structures as a function of n. The FWZ begins to un- 
dercount (as expected) at n = 15 but the FWZ count 
is probably sufficient for applications where it was used. 
Our algorithm is formally complete and does not under- 
count. 

Table |VI| lists the number of superlattices and super- 
structures for the simple cubic lattice when n < A. The 
corresponding structures are visualized in Fig. 11 (com- 
pare this to Fig. [2|. There are more simple cubic de- 
rivitive structures than fcc/bcc because there are more 
superlattices for a simple cubic parent lattice than for a 
fcc/bcc parent. 

Similar to the fee case shown in Fig [2] most of the sim- 
ple cubic superstructures can be characterized as stack- 
ings of pure A and B planes. The stacking directions are 
indicated in the figure. In contrast to the fee case, there 
are 6 unique stacking directions. It is interesting to note 
that the three structures that cannot be characterized 
as pure stackings are the only ones corresponding to a 
composite quotient group, namely G = Z2 © Z2. This 
is also true for the non-stacked structures in the fee case 
(Fig. |2|, LI2 and AgPda. For the 'stackable' structures, 
the quotient group is a single cyclic group, Z4. 



Table VII lists the number of fcc/bcc ternary and qua- 



IV. APPLICATIONS 

We give here a few examples of how the list of deriva- 
tive superstructures can be used. An in-depth discussion 
of each example is beyond the scope of this paper. The 
full results of each example will be reported in forthcom- 
ing publications but we summarize our findings here to 
demonstrate how the efficient enumeration of derivative 
structures can be applied to discover new physics or aid 
in materials design. 



A. fee example 

Figure [2] shows small-unit-cell binary superstructures 
for an fee parent lattice enumerated with our algorithm. 
There are 17 unique structures in the set, two for n — 2, 
three for n = 3, and 12 for n = A. Most of the structures 
can be envisioned as stackings, along different crystallo- 
graphic directions, of layers containing a single kind of 
atom. For instance, the n = 2 structure labeled LIq can 
be envisaged as A and B layers alternately stacked in the 
[001] direction. The stacking directions are indicated by 
green planes in the figure. 

Of these 17 small-unit-cell fee superstructures, some 
are well-known structures and others have never been 
observed (blue shading). Some have never been ob- 
served experimentally but are predicted^ to be ther- 
modynamically stable at low temperatures (yellow shad- 
ing). It is intriguing to ask why, among these small. 
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FIG. 11: The first 21 binary 
structures derived from tiie sim- 
ple cubic lattice. All have 4 
atoms/cell or less. Structures 
marked with a crystallographic 
direction (hkl) can be charac- 
terized as a stacking of pure A 
and B atomic layers (black and 
white spheres. For n = 4 all of 
the stacked structures occur in 
pairs, A3B1 and A2B2. The last 
three structures, labeled 19, 20, 
and 21, cannot be characterized 
as pure stackings. These struc- 
tures have basis vectors whose 
corresponding quotient group is 
G = Z2 Z2 (rather than Z4). 
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simple structures, some are observed in Nature and oth- 
ers are not. Recently, a metric of 'bonding randomness' 
was proposecpSl that shows structures with low bonding 
randomness are more likely to be observed in Nature. 

In the case of these 17 fcc-derived superstructures, 
those that have been observed (unshaded) are generally 
'unrandom', those that that have never been observed or 
even predicted to exist (blue shading) have a highly ran- 
dom bonding, and those in between are those that have 
not yet been observed but have been predicted to exist 
(see Fig. 4 in Ref . [TO]) . Noteworthy among these results is 
the case of the structure predicted to exist in Cd-Pt at a 
1:3 stoichiometry (see Fig. [2]). We refer to this structure 
as LI3 in analogy to the Strukturhericht designations for 
related structures. 
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FIG. 12: The fcc-derived superstructure of PtgTi. The struc- 
tures is shown as a face-centered tetragonal structure. The 
dotted lines show the conventional fee cell. Black spheres 
represent Pt, white spheres Ti. This structure has the lowest 
bonding randomness of all 9 atom/cell fee superstructures. 



The first theoretical discovery of the LI3 structure was 
by Miiller in Ag-Pd.l^SlLater, a datamining searctpi' found 
LI3 as a candidate ground state in two binary systems, 
Cd-Pt and Pd-Pt. Subsequently, we constructed clus- 
ter expansions for these two systems so that the en- 
ergy of any configuration could be computed quickly. 
Then, by enumerating all possible configurations, we per- 
formed (practically) exhaustive searches of fcc-derived 
superstructures and verified that this structure is glob- 
ally stable in these two systems. 

Because the LI3 structure had not been observed and 
was not even suspected, the computational discovery of 
LI3 could have never occurred without exhaustively ex- 
amining derivative superstructures as we have done here. 
Without the enumeration, this structure never would 
have been considered in the searches. Our new algo- 
rithm provides a faster, more general way to enumerate 
derivative superstructures. It has turned up a number 
of unsuspected structures with low bonding randomness, 
which are good candidates for new structures that are 
likely to be observed experimentally. 



B. 8:1 stoichiometry example 

In this example we apply the enumeration algorithm 
to structures with a fixed concentration. A structure 
with an unusual stoichiometry of 8:1 was discovered^ 
in Pt-Ti in the late 19 50s (s ee Fig. [l2]). The structure 
was then 're-discovered^22l2ll in Ni-Nb in 1969, and has 
since been observed in about a dozen more Pt/Pd/Ni- 
rich intermetallic systems. This structure has practical 
import because its impact on the alloy. Pure platinum 
and palladium are quite soft but the PtgTi structure can 
significantly strengthen the material, while at the same 
time maintaining the purity of the host material. (Inter- 
national jewelery hallmarking standards require that the 
alloy be at least 95 wt-% pure.) 

It would be useful to know which Pt/Pd-rich com- 
pounds can form stable structures and what the struc- 
tures are. Using the structural enumeration method 
described in this paper, we generated all 9-atom/cell, 
8:1 stoichiometry structures — there are 14 such struc- 
tures. Taking these enumerated structures, we rankedlSl 
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FIG. 13: Depiction of the perovskite structures. B-sites are 
the cube corners, A-sites the cube centers. The oxygen atoms 
(not shown) form an octahedron around the A-sites. 



the structures and find that the PtgTi structure has the 
highest likeHhood among the 14. However, one of the 
other 13 structures has a HkeUhood index nearly as high 
as PtgTi. Therefore it is another candidate for a new 
Pt/Pd/Ni-rich structures. Using first-principles meth- 
ods, combined with the enumeration algorithm, we are 
conducting a broad search for the other high-likelihood 
Pt/Pd-rich structures. So far we have found 9 new binary 
systems that have undiscovered Pt- or Pd-rich ground 
states .l2al 



The atoms on the substitution sites will often order, re- 
sulting in a superstructure larger than the original cubic 
unit cell. Because the ordering also afi^ects the materials 
properties (sometimes drastically), it is helpful to know 
what ordered structures are possible and which are likely. 
This is precisely the information that our algorithm gives. 
The B-sites of the perovskite structure form a simple 
cubic lattice as shown in the figure (big spheres). To 
find all possible orderings of B and B' atoms on the B 
site, we apply our algorithm to the case of a simple cubic 
superstructures. For unit cells with 4 B-atoms or less per 
unit cell, we find 21 possible structures (see Table VI and 
Fig.jTI]). 

of 



A 



nu mber of t hese superstructures have been 
observed .l^I^lISIMinipQj. ^ = 2, all three types have been 

observed. For n = 3, the (111) oriented structure has 
been observed. The only n — A structure that we are 
aware of is the A3B1 structure that we have labeled [21] in 
Fig. [TT] Apparently, only these five structures have been 
observed in B-site mixed perovskites. Whether or not 
there are other structures that are likely awaits further 
exploration, but our algorithm provides a starting point 
for finding complex perovskites with new structures. 



D. Ternary fee structures 



C. Perovskite example 

Because our enumeration algorithm works for any par- 
ent lattice (i.e., it is independent of geometery), we can 
apply it to cases outside the realm of normal alloy ques- 
tions (usually fee- or bcc-based systems). This example 
demonstrates its use in ordering problems where the par- 
ent lattice is simple cubic. There is a large class of oxide 
compounds where the 'configuration question' is based 
on a simple cubic parent lattice. 

These oxide compounds have a number of important 
properties such as ferroelectricity and superconductivity. 
Because of their important scientific and industrial appli- 
cations, there is motivation to optimize their functional 
properties to meet different design needs. The first ap- 
proach to optimizing their properties is to change the 
constituent elements. This approach is limited by the 
number of elements that will form the perovskite struc- 
ture so to further tune the properties, researchers create 
mixtures of two (or more) different perovskites. 

These oxides have the perovskites structure (see 
Fig. ?? ). The formula unit for this structure is ABO3 
where A and B are typically transition metals or late- 
period metals like Pb. The A atoms are octahedrally- 
coordinated by oxygen atoms, and the A-sites form a 
simple cubic lattice. The B-sites also form a simple cubic 
lattice in the interstitial sites of the oxygen octahedra. 

In an attempt to tune the properties of perovskite ma- 
terials, researchers often introduce a third metal. For B- 
site mixing, the formula unit then becomes ABi_2;B^03. 



The two important generalizations of our algorithm 
over the FWZ method are arbitrary parent lattices and 
arbitrary numbers of labels (atom types). The second 
means we can treat cases beyond binaries, fc > 2. In the 
following example we exploit the latter generalization to 
examine ordered ternary intermetallics. 

Ordered binary intermetallics offer advantages over 
conventional disordered alloys for structural applications 
at high temperatures. But they also have disadvan- 
tages. Although they typically have high moduli, melt- 
ing points, and strain hardening rates, they suffer from 
low ductility and brittle failure. One would like to de- 
velop alloys which can retain the advantages but avoid 
the problem of poor ductility. 

Cubic Ll2-like alloys often meet the ductility require- 
ment but unfortunately occur in nickel, cobalt, and plat- 
inum based binary alloys which are too dense for the de- 
sired applications. Aluminum-rich alloys would therefore 
be attractive but for the fact that most binary aluminum 
compounds are not cubic. Thus we need a way to form 
cubic alloys with lightweight constituents. 

One promising avenue is multicomponent alloys (fc > 
2), primarily ternaries. Adding a third component to 
a binary alloy that forms Ll2-like structures such as 
DO22 often induces a cubic phase that is more duc- 
tile. Although much work has pursued this avenue ,'^51311 
there are significant experimental difficulties in identi- 
fying phases, determining sublattice compositions, and 
discerning crystal structure of ordered phases. Compu- 
tational modeling is complementary to the experimental 
effort. 
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Our algorithm/code provides an important component 
to model ing e fforts. Coupled with a ternary cluster 
expansioEpZESl code we recently developed, we can now 
exhaustively explore the space of possible structures in 
ternary intermet allies. The number of ternary fee su- 
perstructures is vastly larger than the number of binary 
structures (compare Table VII and Table lv| , so a sys- 
tematic approach like ours is even more important. The 
enumerated ternary structures would also be useful in a 
datamining approach like that of Ref. [11] 



V. SUMMARY 

We developed an algorithm for enumerating derivative 
structures. The algorithm first generates all unique su- 
perlattices by enumerating all Hermite normal form ma- 
trices and using the symmetry operations of the parent 
lattice to eliminate rotationally equivalent super lattices. 
Next, the algorithm generates all possible atomic configu- 
rations (labelings) of each superlattice and eliminates du- 
plicates using a group-theoretical approach rather than 
geometric analysis. 

The algorithm is exceptionally efficient due to the use 
of (i) perfect, minimal hash tables and (ii) a group- 
theoretical approach to eliminating duplicate structures. 
These two features result in a linearly scaling algorithm 
that is orders of magnitude faster than the previous 
method. Moreover, the method can be applied to any 
parent lattice and to arbitrary fc-nary systems (binary, 
ternary, quaternary, etc.). The method is formally com- 
plete (does not undercount) and key parts of the algo- 
rithm (and its implementation) can be rigorously checked 
by number theory results and Burnside's lemma. 

We presented results for the number of superlattices 
and derivative structures for several different parent lat- 
tices as well as binary, ternary and quaternary systems. 
Additionally, applications of the algorithm to alloys were 
demonstrated. We coupled the results with cluster ex- 
pansions and the geometric approach of Ref.l^ and (i) 
provided evidence of an altogether new intermetallic 
structure, LI3, in Cd-Pt and Pd-Pt, (ii) found the novel 
PtgTi to be the most likely structure for 8:1 fee deriva- 
tive structures and uncovered another possible new pro- 
totype, (iii) enumerated small-unit-cell structures of per- 
ovskite alloys, providing possible new structures for ma- 
terials design, (iv) demonstrated how the method can be 
applied to aid ternary alloy design. 
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VIII. APPENDIX 

Hermite Normal Form: If L is a lattice, with basis 
given by the columns of a square matrix A, and S* is a 
superlattice, then S will have basis AM where M is a 
square matrix of integers. Furthermore, all bases of S 
will have the form AMN where N is an integer matrix 
with determinant ±1. Thus, to find a canonical basis for 
S, we may use elementary integer column operations on 
M to make it lower triangular, with positive entries down 
the diagonal. Furthermore, we can arrange that the lower 
triangular matrix H = MN have the property that every 
off-diagonal element is less than the diagonal element in 
its row. Such a matrix EI is said to be in Hermite Normal 
Form, and is unique with respect to the matrix M. 

Thus, if the determinant of M is n, then the number 
of superlattices S of L with index n is equal to the num- 
ber of distinct HNF matrices with determinant n. In 3 
dimensions, that number is 



Y,da{d) = l[\ 



(p: 



e.+2 



i)(pr^ -1) 



(p, - l)2(p, + 1) 



d\n 



where n — Y[pT i^ ^^^ prime factorization of n. 

Smith Normal Form: Using elementary integer row 
and column operations (adding or subtracting an inte- 
ger multiple of one row or column to another, multiply- 
ing a row or column by ±1, or exchanging two rows or 
columns), we may reduce the integer matrix M to a di- 
agonal matrix D with the following properties: 

(i) Each diagonal entry of D divides the next one down. 

(ii) The product of the diagonal entries of D is the 
absolute value of the determinant of M. 
This is called the Smith Normal Form of M. In the lattice 
case, where L is a lattice with basis A, and S* is a super- 
lattice (subgroup) with basis AM, then D describes the 
quotient group L/S as a direct sum of cyclic groups. The 
diagonal entries of D are the orders of the cyclic direct 
summands of the quotient group (as in the Fundamental 
Theorem of Finite Abelian Groups). For example, us- 

^Dii 
ing the notation Z„ = Z/nZ, if D = | D22 

D33, 
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FIG. 14: Two-dimensional superlattices of index n = 8. The 
first and third share the same SNF while the second has the 
trivial SNF, implying a purely cyclic quotient group. In con- 
trast to the first and third superlattices, the lattice points in 
layers parallel to the superlattice edge (dotted line) must all 
have the same label. 



then 



L/ S = G — Zd^j 



A simple, two dimensional example, may help to show 
how this affects our lattice labeling problem. Consider 
the three matrices (all in HNF form) 






duces to SNF matrix 



corresponding to the cyclic 



The matrices Hq and IHI2 both reduce to the SNF matrix 

, which corresponds to a quotient group which is 

Abelian, but not cyclic, but the middle matrix. Hi, re- 
\ 0' 

group of order 8. Thus, if we take A to be the identity 
matrix, so L = Z^, and let Si be the lattice with basis 
Mi, then L/Sq and L/S2 are each isomorphic to the group 
Z2 ® Z4, while L/Si is isomorphic to the cyclic group of 
order 8. 

The fact that the latter is cyclic means we can layer 
the parent lattice in such a way that each parallel layer 
consists of points which all must get the same label (see 
Fig ~ 



14 1. We can arbitrarily label each layer passing 



X for some 




through the interior points of the basis parallelogram. 



and label the rest of the lattice cyclically, as if labeling a 
one-dimensional lattice. The quotient group for Sq or S2 
is not cyclic but can just as easily be used to determine 
equivalent labelings. 

In general, SNF provides a natural homomorphism 
from the parent lattice L onto the direct-sum group 
G = Zdii ® Zd22 ffi Z_D33, with kernel S. By the First 
Homomorphism Theorem, it effectively gives an isomor- 
phism from L/S to G. Since we do only elementary inte- 
ger row and column operations, we may write D = PMQ, 
where the transition matrices P and Q are integer matri- 
ces with determinant ±1. Note that AMQ is another ba- 
sis for S, so an element x G S iS MQz — A^^ 
integer column vector z, which is true iff Dz — 
So the map 



VA^^x mod 



(meaning that each row of the resulting column matrix 
is reduced modulo the corresponding diagonal element of 
D) maps from L into the direct sum group G, with its 
kernel being the superlattice S. 

As for computing the SNF of a matrix, there are spe- 
cial algorithms designed to compute it efficiently when 
M is very large but the simplest algorithm, effective for 
small (e.g., 3x3) matrices, is basically an extension of 
Euclid's algorithm for finding the greatest common divi- 
sor of two numbers. One subtracts multiples of elements 
in the matrix from other elements in the same row or col- 
umn (using column or row operations respectively) until 
the greatest common divisor of all the elements of M is 
exposed. That element is then moved to the upper left 
corner of the matrix and used to zero out all other ele- 
ments in the first row and in the first column. Then one 
applies the same algorithm to the 2 by 2 submatrix in the 
lower right. Thus, in particular, the upper left entry in 
D is always the greatest common divisor of all the entries 
inM. 

Note that the number of 3 x 3 SNF matrices with de- 
terminant n is given by J^^ ^^3(6^), where n = Hi Pi' (^^^ 
prime factorization) and Psi^k) is the number of parti- 
tions of an integer k using at most 3 summands]^ 
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